M434 
DIFFERENTIAL 
GEOMETRY 


PART I 
CALCULUS ON EUCLIDEAN SPACE 


TheOpen 
University 


ges 
Mathematics: A Fourth Level Course 
M434 Differential Geometry 


Part! Calculus on Euclidean Space 
Prepared for the Course Team 
by Bob Margolis 


Set book 


Barrett O’Neill, Elementary Differential Geometry, hardback edition (Academic 
Press, 1966). 


It is essential to have this book; the course is based on it and will not make sense 
without it. 


The set book is referred to as O'Neill. 


The Open University, Walton Hall, Milton Keynes, MK7 6AA. 
First published as M334 1976. 
First published as M434 1993. Reprinted 1996. 


Copyright © 1993 The Open University 


All rights reserved. No part of this publication may be reproduced, stored in a retrieval system 
or transmitted in any form or by any means, without written permission from the publisher or a 
licence from the Copyright Licensing Agency Limited. Details of such licences (for reprographic 
reproduction) may be obtained from the Copyright Licensing Agency Ltd of 90 Tottenham Court 
Road, London, W1P 9HE. 


Edited, designed and typeset by the Open University using the Open University TEX Sy: 
Printed in the United Kingdom by Hobbs the Printers Ltd, Totton, Hampshire S040 3WX. 
ISBN 0 7492 4770 3 


This text forms part of an Open University Fourth Level Course. If you would like a copy of 
Studying with The Open University, please write to the Central Enquiry Service, PO Box 200, 
The Open University, Walton Hall, Milton Keynes, MK7 6YZ. If you have not already enrolled 
on the Course and would like to buy this or other Open University material, please write to Open 
University Educational Enterprises Ltd, 12 Cofferidge Close, Stony Stratford, Milton Keynes, 
MK11 1BY, United Kingdom. 


1.2 


Contents 


Introduction 

Euclidean space 
Tangent vectors 
Directional derivatives 
Curves 

1-Forms 

Differential forms 


Mappings 


BNI Doe FwWN 


Summary 


Solutions to the exercises 


Introduction 


This text, together with the first two sections of Part II, will cover most of the 
mathematical ideas and methods needed in this course for studying curves and sur- 
faces in three dimensions. Because of the concentration of methods here, unrelieved 
by any applications, you may find the work rather indigestible at first reading. We 
hope that a full appreciation of the significance of the ideas will develop as you 
apply them to curves and surfaces. This introduction is intended to give you some 
signposts to follow. 


The Introduction in O’Neill is brief, but important, as it introduces several of the 
notational conventions used in the course. It also gives a brief reminder of some 
ideas about functions from R to R. 


It might be as well to say here that in the course as a whole the word ‘function’, 
when used without further qualification, should be taken to mean a function from 
Eĉ? to R. That is, saying ‘f is a function’ indicates that 


f:E° — R. 


Before we can hope to describe curves and surfaces, we need a precise description 
of the ‘three-dimensional space’ where they will be located. You have already met 
the vector space R? as a mathematical version of the space in which we live—often 
referred to as (three-dimensional) Euclidean space. In Section 1, O’Neill refers to 
this space as E?. This change of name is not perversity: as will emerge, E? is 
actually rather more than R3. 


As mentioned in Part 0, in the historical note, we shall be using sets of axes (Dar- 
boux’ ‘moving frames’) attached to each point of the object (curve or surface) that 
we are studying. Actually, rather than axes, we shall place a set of basis vectors 
at each point. In order to formalize the notion of ‘vector placed at a point’ the 
concept of tangent vector is introduced in Section 2. Once we have the idea of a 
vector placed at a particular point, we shall generalize to placing vectors at each 
point of E? according to some rule. This gives rise to the notion of vector field. 


Note: As in so much of mathematics, you have to be careful not to read too much 
into everyday words which are used in a new context. The ‘tangent’ in ‘tangent 
vector’ is a generalization of the usual meaning and does not necessarily imply that 
the vector is a tangent to a curve or surface in the usual sense of the word. In fact, 
when dealing with curves we shall have some tangent vectors which are tangents to 
the curve and some which are not. The new use of ‘tangent’ will seem strange at 
first, but you will become used to it with practice. 


The key feature of the ‘moving frames’ approach is the study of the rates of change 
of the basis vectors as we move around the object. being studied. Thus, to make 
any progress, we shall need careful definitions of what we mean by rates of change, 
or derivatives, of vectors. We do not attempt to deal with vectors immediately; we 
start with rates of change of functions. Here we run into the first new idea. The 
problem is that, in E%, there are infinitely many ways to set off from a particular 
point. The rate of change of a function may well depend not only on where you are 
but also in which direction you set off. Actually, we introduce a further feature: 
we also permit the rate of change to depend on how fast you set off, as well as 
in which direction. This is not actually a complication as it permits us to specify 
the direction and speed of setting off by a vector. The direction is specified by 
the direction of the vector, and the speed by the size of the vector. Taking all 
these factors into account leads to the concept of a directional derivative. Section 3 
defines directional derivatives and proves that they have the linearity and Leibniz 
(product rule) properties that all our derivatives will have. 


Section 4 introduces a formal definition of what is meant in the course by a curve. 
The definition is not quite what you might have expected; curves turn out to be 
functions rather than distorted lines in space. 


In Section 5 we return to the theme of directional derivatives of functions. As we 
indicated above, the derivative of a function can be expected to depend on where we 


4 


These are the ‘based vectors’ 
of MST204. 


are (specified by three coordinates) and the vector specifying direction and speed 
of departure from that point (another three coordinates). Thus the directional 
derivative will be a function of these six quantities. To give a compact notation for 
such things, we introduce J-forms. 


Having introduced 1-forms, Section 6 explores some algebra using 1-forms. Addition 
and subtraction turn out to be straightforward; multiplication and differentiation 
are more involved. The ideas from this section will not be applied to anything for 
some time, so you may well want to skim this section now and come back to it 
again later. 


We are still, at this stage, some way short of the goal of definitions of derivatives 
of sets of basis vectors. Section 7 makes a step forward by introducing functions 


E” — E”, 
where n and m are usually 2 or 3. We refer to these as mappings. You will become 


familiar with these new objects as you work through the course. Having introduced 
mappings, we then investigate how they can be combined and differentiated. 


Finally, Section 8 provides a very brief summary of this part. 


Study advice 


One possible breakdown into study weeks is the following. 

Week 1 O’Neill, Chapter I, Introduction and Sections 1, 2tand.3. 
Week 2 O’Neill, Chapter I, Sections 4 and 5. 

Week 3 O’Neill, Chapter I, Sections 6, 7 and 8. 


This plan leaves one study week for the first two sections of Part II and the asso- 
ciated Tutor-marked Assignment. The suggestion above is exactly that—a sugges- 
tion. After working through the first couple of sections you will have a better idea 
of how fast you can study O’Neill. Please allow for the fact that the work becomes 
less closely connected with previous experience as you go on, and that your progress 
may become slower as a result, 


Do not be tempted to skip exercises. Although there is much powerful theory 
developed in the course, we expect you to be able to apply the theory to specific 
examples, and regular practice will help you. We expect that a significant part of 
your study time will be taken up by the exercises. 


The exercises for a particular section may be taken from O'Neill, given in this text 
or both. Solutions for all exercises, regardless of origin, appear at the end of this 
text. Even if you feel that your solution is completely correct, please read ours; 
there may well be additional points made in the course of the solution. 


Exercises from O’Neill are given by page and number. 


1-forms will be used in 
Sections 5-8 of Part II. 


1 Euclidean space 


Read O'Neill: Introduction and Chapter |, Section 1, pages 1-5. 


Euclidean 3-space You may well have seen this denoted by R? rather than Eĉ. 


Natural coordinate functions This use of z, y and z to represent functions may 
well be new to you. However, the function view does correspond to how these 
symbols are used in elementary mathematics. For example, 


‘the value of z at the point (2,3) is 2’ 


should correspond to the sort of usage of z that is familiar. This phrase is effectively 
explaining what the value of the function æ is at a point in its domain. All that 
O’Neill has done is to formalize this. 


Differentiable In differential geometry we shall want our functions to behave 
‘smoothly’. If you consider a function f from R to R, it will have a graph without 
‘sharp points’ if we require that its derivative f’ exists everywhere. The existence 
of f’ on the domain of f ensures that the graph has a tangent everywhere. If we 
also require f’ to be continuous, we shall ensure that the tangent to the graph 
cannot make any sudden changes of direction. Going further, requiring f to have 
derivatives of all possible orders should ensure that the graph of f is ‘smooth’. The 
definition on page 4 generalizes this idea to E°. 


The comments on the definition (page 5) simply point out that we really need 
to worry about f and its derivatives only on the region of E in which we are 
interested. 


The final comment: that some of the concepts are ‘over-elaborate’ in one dimension 
is quite important. Actually, the differences between a number of the concepts 
apparently vanish in the one-dimensional case. Thus, in some cases, generalizing 
a familiar idea about functions from R to R requires not one but several new 
definitions. 


Exercise 1.1 O'Neill, page 5, Exercise 1. 
Exercise 1.2 O’Neill, page 5, Exercise 2. 
Exercise 1.3 O’Neill, page 6, Exercise 3. 


Exercise 1.4 O’Neill, page 6, Exercise 4. Note that this exercise defines the com- 
posite of a function 
h:E? —R 
with three functions 
91,92,93 : E? — R. 
[Solutions on page 27] 


M203 


The region will always be an 
open subset of E. 


2 Tangent vectors 


Read ovneit: Chapter I, Section 2, pages 6-10. 


is 


Tangent vector The reason for the apparently curious choice of name for this 
concept will emerge when we study surfaces, where most of the tangent vectors 
will also be tangents (in the intuitive sense) to surfaces. For now, it is important 
to realize that a tangent vector may not be (geometrically) a tangent to anything 
obvious. The important thing is to visualize Vp as a vector parallel to v, but with 
its blunt end at the point p rather than at the origin. 


We shall be studying geometric objects from the point of view of someone living on 
the object, with a personal set of axes. The concept of a tangent vector is exactly 
what we shall need to define axes at a particular point. 


We shall make another use of tangent vectors. A tangent vector Vp defines a line 
through p pointing in the direction of v by the equation 


r=p+tv, tER. 


r=p+tv 


Tangent space This definition formalizes the idea of taking an egocentric view 
of the world! One way of looking at it is to consider the ‘global’ coordinate system 
as being imposed by an outsider. Then, if you are located at p, 


Tp(E*) 
represents your personal copy of E?, with its ‘origin’ at p. 


Vector field This concept will be used to formalize the idea of carrying a set of 
axes around. Instead of actually moving, say, the z-axis around, we assume that 
each point p in space has been provided with the unit vector (1,0,0)p to indicate 
the direction of the (local) z-axis. The function 


Ui: p — (1,0, 0)p, 


which provides the unit vector at each point, is an example of a vector field. Note 
the difference between U;, which is a function, and U, (p), which is a particular 
tangent vector at a particular point. 


Pointwise principle This principle is nothing more than the fact that to define 
a function you must specify its value at each point in the domain. 


Natural frame field This set of vector fields essentially provides each point in 
E’? with a copy of the standard basis. Lemma 2.5 says that, because a tangent 
vector at any point can be expressed in terms of (1,0,0), (0,1,0) and (0,0,1) at 
that point, a corresponding statement is true of a vector field. 


We shall discuss frame fields in general in Part IT; for now, the only frame field that 
we shall use is the natural frame field. It is worth emphasizing that ‘the natural 
frame field’ means the three vector fields U,,U2 and U3. 


If you think about how actually to specify a vector field, Lemma 2.5 becomes 
natural, not to say inevitable. Specifying a vector field involves explaining which 
tangent vector should be attached to the point 


P = (P1, P2, Ps), 


M203 


whatever the coordinates of p happen to be. Thus we would have to explain how 
to calculate the components of v from the coordinates of p. That is, we have to 
specify the functions that O’Neill calls vı, v2 and v3. 


The identity 
V = vU + v2U2 + v3U3 


explains the importance of the natural frame field: it forms a basis for vector fields 
3 
on E”. 


If you regard a vector field V as a way of persuading E? to grow a (straight) hair 
at each point, then requiring V to be differentiable is to demand that the direction 
and size of the hairs varies smoothly as you move from point to point of E. 


Finally, a comment on R® versus E. We would regard R as a three-dimensional, 
real vector space and E? as that vector space, together with all its tangent vectors. 


Exercise 2.1 O'Neill, page 10, Exercise 1 (part (a) only). 
Exercise 2.2 O’Neill, page 10, Exercise 2. 


Exercise 2.3 O’Neill, page 10, Exercise 3. Your answers should be expressed in 
terms of z, y, z, U1, U2 and U3. 


Exercise 2.4 O’Neill, page 11, Exercise 5. 


Note: This exercise shows that the natural frame field is not the only possible 
basis for vector fields on E. The phrase Euclidean coordinate functions was used 
earlier for the v; to indicate that they were the coordinate functions corresponding 
to using the natural frame field as a basis. 


[Solutions on page 27] 


3 Directional derivatives 


Read O'Neill: Chapter |, Section 3, pages 11-14. 


This section marks the start of the process of defining rates of change of objects 
(functions, tangent vectors etc.) as we move around in B®. 


Directional derivatives There is a fundamental problem that arises in general- 
izing the idea of derivative from one dimension to several (here three) dimensions. 
When we want to find the rate of change of a function 


f:R—R 
at a point a € R, we look at the values of f at points ‘near’ a. A point in R. near 


a can be written a + h, where h is either positive or negative. The derivative of f 
at a is found by calculating the limit of 


f(a +h) — f(a) 
h 


as h tends to zero (provided that the limit exists). 
Suppose now that we want the ‘derivative’ of a function 
f:E—R. 


A point ‘near’ p will be of the form p + v, where v is a vector. The number h in the 
one-dimensional case has been replaced by the vector v. Since division by vectors 
is not defined, we cannot simply translate the limit definition above into the new 
situation. 


8 


M203 


We have to accept that the rate of change of f will depend not only on where we 
are (the point p), but also on the direction in which we move away from p. Having 
accepted this change from the one-dimensional case, we make a further change: we 
allow the rate of change of f to depend on how ‘fast? we set off from p, as well as 
on the direction. 


Now, the direction and speed with which we set off from p can conveniently be 
defined by giving a tangent vector Vp based at p, since p and v define a straight 
line 


teR, 


where ¢ = 0 corresponds to the point p. 


r=p+ty, 


We can now define a composite function from R. to R, by calculating values of f 
along this line: 


tr>+ptive- f(p +tv). 


This composite is just a function on R, and so can be differentiated with respect 
to t in the usual way. The value of this (ordinary) derivative at t = 0 is defined to 
be the derivative of f with respect to the tangent vector vp. The formal statement 
of the above discussion is Definition 3.1. 


Note: If you prefer to denote differentiation by dashes, you may prefer the fol- 
lowing form of the definition: 


vplf] = (F(p + tv))' (0). 


There is a strong link between the newly defined directional derivative and the 
partial derivatives that were discussed in Part 0. 


Suppose that we look at the tangent vector 
Vp = (1,0,0)p. 
The definition of vp[f] now gives 
vel Fl = (FP + tv))'(0) 
= (f(p + t(1, 0, 0)))'(0) 
= (f(pı + t, p2, ps))'(0). 


In the last line above, only the first coordinate, p +t, is varying; the other two are 
constant. In other words, the last line is 


8f 
Ox 
evaluated at t = 0, in other words at p. 
Hence, 
Of 
(1,0, 0)p{A1 = S(p). 
Similarly, 
= 28 
(0, 1, 0)p[f] = Oy P), 
_ of 
(0,0, Dols] = Ép). 


Thus, the values of the partial derivatives at a point are the directional derivatives 
with respect to 


(1,0, 0)p, 


This link may well be what is meant by the comment in the paragraph immediately 
following the definition on page 11 of O’Neill. 


(0,1,0)p and (0,0,1)p. 


The dash represents 
differentiation with respect 
to t. 


The worked example following the definition shows how to calculate directional 
derivatives from first principles. However, Lemma 3.2 provides an alternative 
method which is of considerable importance. 


The proof of Lemma 3.2 uses a generalized form of the chain rule (composite rule) 
for differentiating composite functions. You have met the rule for functions from 
R to R, probably in the form M101 and M203 


(Fog) = f'(g) xg’. 
Here, we have a composite 

t — (pı + tvi, p2 + tv2, pa + tv3) — f(p + tv). 
If we define 

gi(t) = pı + tv, 

go(t) = po + tv2, 

g3(t) = ps + tus, 


then we are trying to differentiate the composite 


F(91592, 93): 
The version of the chain rule that applies in this case is We shall make quite heavy 
f this result. You should 
af of af use of ; 
BL oot 1 1 te it, and note also its 
t E 2 ag oe note it, e al 
(Flan 92 9s)) Ox l dy 92 Oz I3 similarity to the ‘single 


It is this result that is applied to prove Lemma 3.2. varablencase: 


Once the definition of directional derivative and a practical method of calculation 
are established, O’Neill goes on to show that directional derivatives have the lin- 
earity and Leibnizian (product) properties that you should be familiar with from 
ordinary derivatives. You might like to compare parts 2 and 3 of Theorem 3.3 
(O'Neill, page 12) with 

(af + bg)! = af’ + by’, 

(f9) = f'9 + fa’, 

for functions f, g from R to Rand a,b E R. 
There is no result in ‘ordinary’ calculus corresponding to Theorem 3.3(1) because 


ordinary differentiation is always carried out in a standard direction and at standard 
(unit) speed. 


Note that the right-hand side of Theorem 3.3(3) is NOT 
vplflg + Í vplg].- 


To see why not, consider what sort of object This type of analysis will 
often help you to reconstruct 
Vp fg] such formulas from a general 


knowledge of the pattern for 


is. The directional derivative of a function with respect to a particular tangent the Leibniz: property. 


vector is a real number. However, 


vplflg + f vplg] 


is the sum of two terms, each of which is a function multiplied by a real number. 
Thus 


vplflg + f vplg] 


is a function, not a real number. On the other hand, evaluating the functions in 
each term at p gives 


vplf] 9(p) + f(P) volg), 


which is a real number. 


10 


The next step that O’Neill takes is similar (but not identical) to the process of 
going from the value of a derivative at a particular point to the idea of a derived 
function. Thus, in elementary calculus, after finding that the derivative of 


foe? 
at 2 is 4, at 3 is 6 and so on, we go on to define the derived function 
f= 22, 


which gives a rule for calculating the derivative everywhere. 


Our goal is to give an expression that will enable us to calculate the value of the 
directional derivative of a function with respect to any tangent vector at any point. 
In this section we go only part of the way towards this goal; the rest of the work is 
done in Section 5. 


Instead of considering the directional derivative of a function f with respect to 
a single tangent vector, we now consider the directional derivatives with respect 
to a whole collection of tangent vectors, one at each point of E’. The collection 
of tangent vectors is provided by a vector field V. If you inspect the paragraph 
beginning half-way down page 13 carefully, you will see that V[f] is defined as a 
composite function 


pr V(p) +> V(p)[f]. 
Since the domain of this composite is E? and the codomain is R, we have that: 
the derivative of a function with respect to a vector field is a function. 


The example that immediately follows the definition is extremely important. From 
remarks that we made earlier, 


(1,0, 0)pt41 = Ze. 


Since U;(p) = (1,0, 0)p, we have U,[f] defined as the composite 


ô 
P= Ui (p) — (1,0,0)al7] = Lee), 
Since the function defined by 
of 
P'— g, P) 


s, by definition, written 
of 
Oa’ 
we have the result 
Yi[f] = a 
and two similar results for U2[f] and Us[f]. 


These results give a sound basis for thinking of the partial derivatives of a function 
as being the derivatives in the x, y and z directions, 


Now that O’Neill has made the definition of V[f], the next step is to show that 
the newly defined derivative also has the linearity and Leibnizian properties. This 
is the content of Corollary 3.4. Note that, this time, the Leibniz property (Corol- 
lary 3.4(3)) is the exact analogue of 


(f9) = f'g + fo’. 
The results of Corollary 3.4 provide the basis for calculating directional derivatives 


with respect to vector fields. (Parts 1 and 2 are probably the most used.) The 
following worked example may help to illustrate the point. 


Since x is a function, this 
way of defining f is 
permissible. 


17 


Example Suppose that the function f :E* — R and the vector field V on E? 
are defined by: 


fare, V =2°U +422 + 2°03. 
Then 
VIF] = (2°U1 + zzUz + 2°U3)[xe"] 
= 2°U;[xe¥] + zzUz[ze”] + 23U3[ze"] (by Corollary 3.4 (1)) 


ð 3 ô 3 
2 y Yy 3 y aa 
F Ize ] bao Ate +r 3: ) (using Ui[f] = 3f/ðx etc.) 
= 2e” + rzre +z x0 
= ze (z +27). 


The example in O’Neill preceding Remark 3.5 uses exactly the same technique: 
application of linearity followed by 


of 

“lfl = 5, 
etc. Lt 
You should note the contents of Remark 3.5, but the course texts will tend to use 
the full form vp rather more than O’Neill does. In particular, we shall often retain 


the full form, vp, including the point of application p, whereas O’Neill tends to 
retain only the vector part. 


Exercise 3.1 O'Neill, page 14, Exercise 1. 
Exercise 3.2 O’Neill, page 15, Exercise 2. 


Exercise 3.3 The vector field V and function f are defined by 
V = 2U; — U2 + 3U3, f =e" cosy. 
Calculate V[f]. 
Exercise 3.4 O’Neill, page 15, Exercise 3. 
Exercise 3.5 Suppose that the vector field V has Euclidean coordinate functions 
v1, v2 and v3, so that 
V = wU, + v2U2 + v3U3. 
Find V[z;] for each coordinate function x;, and hence show that 
3 
V= DD V[aiUi. 
i=l 
Exercise 3.6 If V and W are two vector fields such that 
VIA = Wil 


for every real-valued function f on E®, show that V = W. (Hint: The result of the 
last exercise may be useful.) 


[Solutions on page 29 | 


4 Curves 


—_— sss 


Read ovneit!: Chapter |, Section 4, pages 15-21. 


This section formally defines curves, which is the first type of geometric object that 
you will study in detail in the course; the other being surfaces. 


Curve The definition of curve is, perhaps, not quite what you might have ex- 
pected. It is the function œ which is the curve; the object in E? that you probably 
visualize when thinking about a curve is referred to as the route of the curve. (The 
route is the image set of the function a.) 


The requirement for the domain of a curve to be an interval, together with requiring 
a curve to be differentiable, ensure that the route of a curve is connected (‘all in 
one piece’). 


Much of the language used to discuss curves is based on the idea of a moving point 
tracing out the route of the curve. The choice of ¢ in 


a:t ı— a(t) = (ar (t), a(t), a3(t)) 


suggests ‘time’ as the variable. We shall talk about ‘speed’ and ‘velocity’ in con- 
nection with curves. 


Example 4.2 Examples 4.2(1) and 4.2(2) in O'Neill are particularly important 
and will reappear from time to time. 


Example 4.2(2) The distance of a point in the zy-plane from the origin can be 
found by evaluating the function 


[x2 + y? 


at the point in question. Thus 


(Va? + y?)(acost, asint, 0) = V'a? cos? t +a?sin?t = a 
so all points on the route of the first curve are on a circle of radius a. 


Example 4.2(3) The justification for the description of the route of the curve given 
in O’Neill is contained in Exercise 1 below. 


Example 4.2(5) ‘Brute-force’ plotting of points is only rarely helpful for curves in 
E®, partly due to the difficulty of sketching points on such routes on two-dimensional 
paper. 


Velocity of curve This section of O’Neill introduces the concept of velocity and 
gives it a geometric interpretation. The geometric link might be a little clearer if 
the two diagrams on pages 17 and 18 were interchanged: the second is before the 
limit is taken, the first is after. 


It is important to note that the velocity vector 
(a4 (t), a(t), ag(t)) 


is attached to the point a(t) on the route; it is a tangent vector in the sense that 
we have recently defined. 


There is also a link with the idea of vector field. ‘The process of finding velocities 
at all the points on the route of a curve amounts to defining a vector field 


ah U1 + aU + 04Us, 


but only for points on the route of the curve. 


13 


The term ‘speed’ is mentioned in this section of O’Neill without explanation. The 
definition is straightforward: it is the magnitude of the velocity. Thus, at a(t), the 
speed is 


V4)? + (4 (0)? + (04(0)?. 


Since ||v|| is used in this course to denote the size (magnitude) of the vector v, this 
result can be written 


Speed = |||] = Vio? + (a(t)? + (ag())?. 


Reparametrization This is an important technical process, although you would 
be forgiven for not seeing why at this point in the course! 


To give an indication of its importance, we must refer back to the method that we 
said would be used to study curves: each point will be provided with a ‘local’ basis 
of unit vectors and the change in this basis from place to place will give information 
about the shape of the curve. 


The velocity vector provides a vector at each point on the route of the curve, but it 
is not necessarily of unit length. If we could arrange that the length of the velocity 
vector were 1 at each point (that is, the curve had unit speed everywhere), we should 
have a start for our basis of unit vectors. The later work on reparametrization will 
show that we can always arrange for the route of a curve to be traversed at unit 
speed. 


This section makes a start by showing what happens to the velocity when the 
parametrization is changed. Part IT, Section 2, will complete the process by showing 
how to change the parametrization to achieve unit speed. 


The key step in Lemma 4.5 involves three applications of the usual chain rule 
formula. If you inspect the result carefully, you will see that it can be written 


B=aoh => B=ad'(h)xh’, 
which is a direct analogue of the familiar form of the chain rule. 


Important: Lemma 4.6 is apparently there simply because we have invented a 
new sort of tangent vector—velocities of curves—and we can now use such tangent 
vectors to differentiate functions. In a sense this is so. However, Lemma 4.6 will 
be of vital importance when we come to study surfaces, so it is worth taking some 
time to appreciate exactly what it says. 


As O’Neill says on page 20, finding a/(t)[f] involves finding the rate of change of 
f along the straight line through a(t) in the direction of a/(t). The lemma shows 
that the rate of change of f along the tangent is the same as the rate of change 
along the curve itself. In one way this is not surprising: the curve and the tangent 
point in the same direction and the length of the tangent represents the speed of 
progress along the curve. 


Lemma 4.6 is of only theoretical interest at the moment. Later, however, we shall 
encounter functions which are defined only on a curve or surface and not on the 
whole of E®. In such cases we cannot use the definition to find directional derivatives 
because, given vp, the function in question may not be defined along the whole of 


the straight line p + tv. In such cases we use Lemma 4.6. If we can find a curve a, 
such that 


a(0)=p, a'(0)= vp, 
then, for a function f, we have 
vplf] = (F(e(#)))'(0). 


Paraphrasing the result of Lemma 4.6: in the definition of directional derivative, 


the straight line p + tv can be replaced by any curve œ through p with velocity Vp 
at p. 


14 


You may have used |v| in the 
past. 


Where, as here, it causes no 
confusion, we say ‘along the 
curve’ rather than insist on 
the more precise ‘along the 
route of the curve’. 


The final remarks in this section of O’Neill are not of great importance for this 
course and you need read them only cursorily. 


Exercise 4.1 Let a: (0,1/2) — E? be the curve described in Example 4.2(3) of 
O'Neill, that is, 

a(t) = (2 cos? t, sin 2t, 2sint), 0<t<7/2. 
(a) Find the velocity vector of a for arbitrary t and also for t = n/4. 


(b) Justify O’Neill’s statements about the curve by doing the following. 


(i) Show that a(t) has length 2 for all values of t. (This shows that the route 
of æ lies on the sphere of radius 2, centre the origin.) 


(ii) Show that a(t) is always 1 unit from the point (1,0,2sint). (This shows 
that the route of œ lies on the cylinder of radius 1 whose axis is the line 
z= 1,y=0.) 


(c) Find the coordinate functions of the curve 6 = a(h), where h is the function 
defined by 


h(s) =arcsins, 0<s5 <1. 
Exercise 4.2 O'Neill, page 21, Exercise 4. 
Exercise 4.3 O'Neill, page 21, Exercise 5. Note that the phrase ‘find a straight 
line ...’ means ‘write down the equation in the form a(t) =... for a suitable 
function a’, 


Exercise 4.4 O'Neill, page 21, Exercise 7. 


Exercise 4.5 O’Neill, page 22, Exercise 9. 
[Solutions on page 31] 


5 1-Forms 


r 
Read O'neil: Chapter |, Section 5, pages 22-25. 


Erratum O'Neill, page 25, final line before the exercises: 
for 2, (pa +1)v3 read .. (ps + 2)u3. 


Do not worry if you have not met the definition of differential given at the start of 
this section. 


1-forms In Section 3 we mentioned that this section would complete the task of 
giving a formula for calculating the directional derivative of a function at any point 
with respect to any tangent vector. 


Consider, for a moment, what such a formula would have to look like. Given a 
function f and a tangent vector Vp, the directional derivative would depend on the 
three components pı, pọ and P3 of p and on the three components V1, v2 and v3 
of v. 


15 


For example, let 


f=xyz and vp=(v1, v2, v3)(p1 »P2,P3)> 


then, working from Lemma3.2, 


voll = FEtp) a1 + tp) v2 + Lop) es 


= (yz)(p) vı + (@2)(p) v2 + (zy)(p) vs 
= P2p3vi + pipsv2 + pıp2vs. 
Thus, as expected, vp[f] is a ‘function of six variables’. 


What we are aiming at is a formula (which will be called df) which will produce 
the correct value of the directional derivative when Vp is correctly substituted. 


We already know how to produce the p; from a point p. We use the coordinate 
functions x, y and z. What we now need are functions to play a similar role in 
extracting the v; from a tangent vector. That is precisely what is done by 


dx, dy and dz. 


For the function defined above, we have 
df = (yz) dx + (xz) dy + (ay) dz. 


The concept of 1-form arises from the need to express directional derivatives con- 
cisely. However, they are not defined quite in the way that the need arises. 


Because (‘Theorem 3.3(1)) directional derivatives behave linearly with respect to the 
involvement of the tangent vector, the expression that we arrive at for a general 
directional derivative must involve the v; in a linear manner. Also, for a particular 
tangent vector, the value of the directional derivative is a real number. Thus, once 
we have defined it properly, df will have to be a function whose domain is the set of 
all possible tangent vectors, whose codomain is R and which is linear with respect 
to tangent vectors at each point. 


We have two properties of df: it is a function from tangent vectors to reals and, at 
any point p, it is linear. As is so often the case in mathematics, these two properties 
are extracted and made the definition of a new concept. Here the concept is that 
of a 1-form. The formal definition is given in Definition 5.1. 


Having given a definition of 1-forms (5.1) that does not mention directional deriva- 
tives, there is now the task of showing that 1-forms are the right concept for ex- 
pressing general ‘derivatives’. This is the point of Definition 5.2. 


We have been discussing a way of representing the function 
Vp +> vp[f] 

as a ‘formula’. Definition 5.2 makes a start by giving a name, df, to 
Vp +> vp]. 


A short argument is needed to show that df is a 1-form. (Actually, the definition 
of 1-form was carefully framed so as to make df a 1-form.) 


Example 5.3(1) Having introduced a new concept, some simple examples are in 
order. The simplest non-constant functions on E? are probably the coordinate 
functions. It is worth expanding the working in Example 5.3 a little. Since z is a 
function, we can calculate 


Ox 


Ox Ox 
vele] = 5P)” 4 By P+ gz P)” 
= lv + 0v + 003 


=U. 


Hence 


dz : vp — 4. 


16 


If you check the example 
above, you will see that this 
is the case. 


Even though df is not 
properly defined yet! 


The argument is in the 
paragraph immediately after 
the definition. 


Similarly, 
dy: Vp > vo, 
dz : Vp — 03. 


Thus, dx, dy and dz behave with respect to tangent vectors in exactly the same 
way as z, y and z behave with respect to points. 


You may wonder, with justification, what this use of dz has to do with its use in 
expressions such as 


[ae 


The expression after the integral sign is a 1-form (it is df, where f = 23/3). If 
finding directional derivatives of functions yields 1-forms, it is reasonable to think 
that integration might produce functions from 1-forms. Not only is this reasonable 
but, under suitable conditions, it is also true. Unfortunately, we do not have room 
to consider integration of 1-forms in this course, although O’Neill does mention the 
topic briefly in several places. 


Example 5.3(2) A new idea is inserted here without explicit mention. So far, 
1-forms act on tangent vectors (and produce real numbers), but we can form com- 
posites of 1-forms with vector fields. The case which is used in this example involves 
dx, etc. with the natural frame field Uj, etc. 


When we write dxz(U,), we mean the composite function 
p+— Ui (p) +> dx(Ui(p)). 


Since U;(p) = (1,0,0)p and dx extracts the first component of a tangent vector, 
we have 


du(U,(p)) = 1. 
Similar calculations show that 


dz(Ui(p))=1, dx(Ue(p)) =0, dx(Us(p)) = 0, 
dy(Ui(p)) =0, dy(U2(p)) =1, dy(Us(p)) = 0, 
dz(Ui(p)) =0, dz(U2(p)) =0, d2(Us(p)) = 1. 


These nine results can be summed up in ‘functional’ form as 
dx;(U;) = 6;;. 
The Kronecker delta, ;;, is a standard symbol and will be used a number of times 
in this course. 
If we use the linearity properties of 1-forms and the results summarized as 
dxi(Uj) = ôij, 


then evaluating 1-forms on vector fields becomes quite routine. 


Example Suppose that the vector field V and the 1-form ¢ are defined by 
V=27U,- 2U2+yU3 and d=ydzr+2zdy—zdz. 
Then 
(V) = (ydz + z dy — zdz)(?U, — zU3 + yU3) 
= ydx(27U, — zUz + yU3) + z dy(x?U, — zU2 + yU3) 
— zdz(a?U, — zU + yU3) (linearity) 
= y(x?) dz(U1) + y(—z) dæ(U2) + y(y) dz(U3) + x(x?) dy(U1) 
+ 2(—2) dy(U2) + z(y) dy(Us) — z(2?) d2(U1) — z(-2) d2(U2) 
— z(y)dz(U3) (linearity) 
= (2°y+0+0)+(0—22+0)4+(0+0- zy) (using dz;(U;) = 6;;) 
= 27y — ez — yz. E 


The definition of 6;; is 6;; = 1 


ifi=j and 6 =0ifi Ż j. 


Having found that dz, dy and dz provide examples of 1-forms, O’Neill goes on to 
show that these three are all the 1-forms that you need (at least for E’). Lemma 5.4 
shows that any 1-form can be expressed in terms of these ‘basic’ 1-forms. 


Warning: Although differentials of functions provided the motivation for intro- 
ducing 1-forms and will also provide many of the examples of 1-forms, it is quite 
easy to write down a 1-form which is not the differential of a function on E. For 
example, the 1-form 


dz + z dy + z? dz 


cannot be df for any f. To see why not, spend a few minutes trying to find f such 
that 


of of Of _ ə 
ia a a g 
Calculating differentials The remark after Lemma 5.7 is useful. Using 
of of Of 
df= —d 2 a 
PS Gee ay ay 


is not usually the quickest way of finding df, as the example illustrates. 


Also illustrated by the example is the fact that, now differentials are available, it is 
easier to find a directional derivative 


vplf] 
by finding df and then using 


vplf] = df(vp). 
Exercise 5.1 O'Neill, page 25, Exercise 1. 
Exercise 5.2 O'Neill, page 25, Exercise 3. 
Exercise 5.3 O'Neill, page 26, Exercise 4. 
Exercise 5.4 O’Neill, page 26, Exercise 5 (first part only). 


Exercise 5.5 O'Neill, page 26, Exercise 7. 
[Solutions on page 32] 


6 Differential forms 


Read ovneil': Chapter |, Section 6, pages 26-31. 
—_Ř—ŘŘ— 


Errata 
1 O'Neill, page 31, Exercise 6, last line: 
for ...r,¢,z... read ...r,0,z.... 
2 O'Neill, page 29, line 8: 
for dọ = d(zy)dz +d(z°)dz. read dé= d(xy) A dz + d(x?) A dz. 


O’Neill is following his convention of omitting the wedge when one term is dz, dy 
or dz. 


3 O'Neill, page 29, 4 lines from the bottom: 
for dp read dọ. E 


This section of O’Neill contains no difficult computational ideas. The problem is, 
rather, why introduce 2-forms and 3-forms at all? Little use is made of 3-forms in 
the course, but 2-forms appear in the work on surfaces. 


2-forms Although it is not required for the exercises on this section, it seems 
reasonable to give some hints about what 2-forms are, in the same way as we have 
defined 1-forms to be real-valued functions on tangent vectors. 


If you have retained a feeling, in spite of the last section, that dz has something 
to do with a ‘small increment’ in the z-direction, then you might suspect that the 
2-form da dy might have something to do with a ‘small area’. It is certainly the case 
that some of the 2-forms that we shall meet are connected with areas, although not 
all of them are. 


Just as 1-forms operated on tangent vectors, 2-forms act on pairs of tangent vectors 
based at the same point. The details will emerge later in the course; for now we 
just give an example. The 2-form dz dy acts on the pair of tangent vectors (Vp, Wp) 
at p as follows: 


(dz dy)(vp, Wp) = dz(vp) dy(wp) — de(wp) dy(vp). 
If we evaluate the terms, we obtain 
(dx dy)(vp, Wp) = viw — wv. 
The other ‘basic’ 2-forms dx dz and dy dz behave in a similar way. 


The important thing at this stage is to be able to do calculations with differential 
forms. A more complete understanding of their meaning should emerge when we 
begin to use them later on. 


Exterior derivative You might find it helpful to paraphrase Definition 6.3 as: ‘to 
differentiate a 1-form, apply d to the coefficient functions’. 


It is clearly possible to extend Definition 6.3 to 2-forms as follows. 


Exterior derivative of 2-forms 
If 7 is the 2-form 

n= fı dz dy + fodydz + fz dz dz, 
then the exterior derivative of 7 is the 3-form 


dn = df; A dz dy + df A dy dz + df A dz dz. 


Theorem 6.4 By now, the pattern used by O’Neill should be becoming familiar. 
Having defined a new form of differentiation (the exterior derivative), there follows 
a proof that the derivative obeys the linear and Leibniz rules that help in doing 
calculations. The point worthy of special note here is the form of Theorem 6.4(3) 
with its minus sign. 


Optional Comment: If you have met the concepts of ‘div’, ‘grad’ and ‘curl’, 
you may be interested in page 31, Exercise 8, which explains the link between 
differential forms and these ideas. 


For our purposes forms provide a very concise way of representing some of the ideas 
of differential geometry and we shall not use any of the other possible approaches 
from vector calculus. 

Exercise 6.1 O'Neill, page 31, Exercise 1. 

Exercise 6.2 O'Neill, page 31, Exercise 2. 

Exercise 6.3 O'Neill, page 31, Exercise 3. 


Exercise 6.4 O'Neill, page 31, Exercise 4. 


Another, classical, approach 
uses tensors. We shall avoid 


them as well! 


19 


Exercise 6.5 O'Neill, page 31, Exercise 7. (Hint: Use the fact that any 1-form ¢ 
can be written as 


3 
X fide; 
i=1 
for suitable functions fi.) 


Exercise 6.6 O’Neill, page 32, Exercise 9. 
[Solutions on page 33] 


7 Mappings 


Read O'neill: Chapter |, Section 7, pages 32-39. 


Errata 
1 O'Neill, page 36, second line after the proof of Theorem 7.5: 


for F,(v), read F,(v), 


that is, the vector v should be in bold type. 
2 O'Neill, page 37, proof of Corollary 7.7: 
for ‘in Corollary 7.6’ read ‘in Theorem 7.5’. | 


Mapping In this section, O’Neill generalizes the idea of a function 
f:EF—R 

to that of a function 
F:E" — E”, 


and if F is differentiable (in the sense defined on page 33) then such functions are 
called mappings. Thus, both real-valued functions on E° and curves are special 
cases of mappings. 


We shall use mappings for two purposes in this course. First, we shall use a special 
type of mapping (isometries) in our work on curves. Secondly, a mapping between 
a ‘simple’ surface, such as a sphere, and a more complicated one will help in the 
study of the shape of the more complicated surface. For this second purpose, we 
shall need to know how a mapping ‘distorts’ geometric objects (such as curves) in 
its domain. 


Definition 7.1 This tells you how to specify a mapping: you give the coordinate 
functions. 


If you inspect the paragraph following the definition carefully, you will see that 
the ‘pointwise’ and ‘coordinate function’ forms for specifying a mapping are very 
little different; the function form is more concise. You should find little difficulty 
in switching from one form to the other. What this paragraph does re-emphasize 
is that x,y and z are functions, whereas pı, pọ and p3 are real numbers. 


The final paragraph before Definition 7.2 makes an important point. It is not easy 
to decide at a glance what effect a mapping has on its domain. Just as functions 
from R to R. can be investigated by using derivatives, so we shall use ‘derivatives’ 
of mappings to help in understanding them. To begin with, however, we look at 
what a mapping does to the route of a curve. 


20 


Example 7.3(1) In matrix form, with respect to the standard bases in domain 
and codomain, F can be written 


Pı i =1 0 Pı 
p | —|1 1 0 P2 |. 
P3 0 0 2 P3 


Example 7.3(2) There is no reason why you should have thought of the idea of 
considering the image of the circle a. However, it does give some insight into what 
the mapping does. 


Definition 7.4 This is the definition of a derivative for mappings mentioned above. 
If we write out explicitly the calculation involved, we can see that 


F.(vp) = (F(p + tv))'(0) r(p)- 


In this form, it is easier to see the family connection with the definition of directional 
derivative of a function on E°. Indeed, directional derivatives can now be seen as 
a special case of derivative maps. 


It is worth emphasizing that 
F tells you what happens to each point p in the domain; 
F, tells you what happens to each tangent vector Vp at p, 
by mapping it to a tangent vector at F(p). 


A very simple example might help to explain why the derivative map can give 
information about the distorting properties of a mapping. 


Example Consider the mapping 
F:E? — E, 
F= (z +y,y), 
and the tangent vectors 
Vp = (1, 0)p, 
wp = (0, 1)p. 
Working from the definition, 
F.(vp) = (F(p + t(1, 0)))'(0) 
= (F(pı +t, p2))' (0) 
= (pi +t + po, p2)' (0) 
= (1,0)(0) 
=(1,0) at F(p), 


F.(wp) = (F(p + t(0, 1)))(0) 
= (F(p1, p2 + t))'(0) 
= (pı + p2 +t, p2 + t)’(0) 
= (1,1)(0) 
=(1,1) at F(p). 
Thus, at any point p, the tangent vectors (1,0) and (0,1) are transformed to the 
tangent vectors (1,0) and (1,1) at the image F(p). 


F 
This does give some idea of how the mapping distorts the domain. E 


M203 


21 


Theorem 7.5 This result makes the connection between derivative maps and 
directional derivatives quite explicit. Paraphrasing: to find the value of the deriva- 
tive map on vp, directionally differentiate the coordinate functions with respect 


to vp. (This paraphrase leaves out the point of application, but is a useful aid to 
memory nevertheless.) 


The link with directional derivatives is the basis for a straightforward method of 
calculating derivative maps. The method for calculating directional derivatives in 
Lemma 3.2, 


ô ô ô 
vpl f] = L op) ut L (p) v2 + Fp) v3; 


can be written as a matrix product 


vi 
win= (Ho e o) (2) 


or, even, 
_ (of of ðf a 
=(% y Hof) 


U3 


T 


Vp 


Translating the result of Theorem 7.5 into matrix form, representing tangent vectors 
by column matrices with respect to the standard bases, leads directly to the idea 
of Jacobian Matrix discussed on page 37. For example, if 


F = (fi, fa, fs) 


is a mapping from E? to B°, then the image of a tangent vector vp under F, can 
be calculated as 


ð ô ð 
Sio io io 
afa Bhe Ph a 
Gq P) Gy P p P) (2) 
ô f3 ô f3 Ofs 
Dal ) ay P) 3, P) 

at F(p). 

It is usual to shorten 
Of in Of Of: Of, Of, 
Ja P) Dy P p P) ie o oz 
Ofe ô fə Ofe fz fz fz 
Fa P) Dy P 3, (P) | te ie ty oe (p). 
Ofs Ofs Ofs Ofs fs Ofs 
Ja P) Dy P 3, P) oz Oy Oz 


Strictly, the Jacobian matrix 


aft 
Ox 
dfo 
Ox 
Of 
Ox 


afi 
oy 
Ofe 
oy 
Ofs 
Oy 


af, 
Oz 
DA 
Oz 
ofs 
Oz 


represents F, only with respect to the standard bases in domain and codomain. 


22 


However, it is an understandable and forgiveable abuse to write 


Of: ðf Of 
Ox Oy az 
pu|oh a am 
“| ô Oy Oz 
fs Ofs ofa 
Ox. Oy az 


instead of ‘F, is represented by ...with respect to ...’. 


Linear approximation (Comment in second paragraph at top of page 37.) This 


comment refers to the link between the approximation 
Flath) = f(a) + f'(a) xh 

for differentiable functions from R. to R. and the approximation 
F(p +v) ~ F(p) + F.(v). 


The parallel is more obvious when the second approximation is written in column 
coordinate form: 


F(p +v) ~ F(p) + (Jacobian matrix at p) | vz 


Theorem 7.8 This result is important not only because it says that velocities of 
curves are preserved by F, but also because it is the analogue of Lemma 4.6. It 
says that the straight line 


pttv 


in the definition of F, can be replaced by any curve through p with velocity vp. 
Since F, is obtained by directionally differentiating the coordinate functions, you 
should expect results that are true of directional derivatives to be true, in a suitable 
sense, of derivative maps. 


Definition 7.9 and Theorem 7.10 These are mentioned here for completeness. In 
this course we shall use this definition and result in only one case. When we define 
surfaces, we shall use a two-dimensional version of the definition of curve. We shall 
have mappings from a suitable subset of E? to Eĉ. Although the surface defined 
by such mappings may be a very complicated object when viewed as a whole, we 
shall want any small piece of it to look quite like a piece of the plane. Regularity 
of the mappings used to define surfaces is exactly the condition that we need. 


Checking regularity involves finding the rank of the Jacobian matrix. Now, the rank 
of a matrix is the maximum number of linearly independent rows (or columns). In 
this course, we are interested in mappings from E” to E” in the cases 


n,m=1,2,3. 


If the dimension of the domain is greater than that of the codomain, then the 
mapping cannot be regular. For example, if 


F:E3 — E’ 
F= (fi, fa), 
then the Jacobian matrix representing F, is 
oh oh ah 
z 
oh oh oh (p). 
Qn Oy 62 


This cannot have more than 2 independent rows, so its rank cannot exceed 2, which 
is less than the dimension of the domain. 


Even more precisely, we 
should say that the matrix 
represents F, only with 
respect to the natural frame 
field in domain and 
codomain. We should do so 
because F, transforms 
tangent vectors, not vectors. 


It is a theorem of linear 
algebra that for any matrix 
the maximum number of 
linearly independent rows is 
the same as the maximum 
number of linearly 
independent columns. 


23 


In practice, the commonest situation in which you will be required to check for 
regularity is for mappings 


E? — E?, 


where the task reduces to showing that the 3 x 2 Jacobian matrix has two indepen- 
dent columns. 


Composite mappings An important result is hidden by O’Neill in two exercises. 
Because of the importance of the result, we ask you to do the two exercises now. 
Our solutions (with comments) follow immediately. 


Try O’Neill, page 40, Exercises 7 and 12(b) now. 
Solution to Exercise 7 In full: 
G(F(p1, p2)) = G(fi(p1, p2), fo(pr, p2)) 
= (g1(Fi(P1; P2), f2(P1, P2)), 92(f (P1, P2), f2(P1, P2)))- 
In function form: 
GF = (91 (fi, f2), g2( fi, fe))- 


Because F and G are mappings, their coordinate functions are differentiable. Thus, 
the coordinate functions of GF involve composites of differentiable functions. Hence, 
the coordinate functions of GF are differentiable and GF is a mapping. 


Solution to Exercise 12(b) This requires three applications of Theorem 7.8. First, 
let 


B= G(F(a)). 
Then, by Theorem 7.8 applied to GF 
(GF) (a) = 6". 


Now, if we define 


= F(a) 
and apply Theorem 7.8 to F, we obtain 
F,(a’) =y. 
But 
b= G(7), 
so 
G=: 


Putting these results together gives 
(GF) (a) = p 
=G.(7) 
= G.(F.(a')). 
The consequence of this last result is a ‘composite rule’ for derivative maps. Since 
(GF).(a') = G.(F.(a’)) 
holds for all possible curves in the domain, we conclude that 
(GF), = G.(F.). 


The composite rule as stated above is a little dangerous when using Jacobian ma- 
trices to carry out calculations. If we recall the composite rule for functions from 
R to R in the form 


(go FY =9'(f) xf, 


perhaps you can see why. The derivative g’ must be evaluated at the image under 
the first function, f. 


24 


O’Neill writes GF for the 
composite, where you may be 
used to writing Go F. 


Consider what happens under the various composites that we are discussing. The 
mappings transform points: 


P '— F(p)+> G(F(p)). 
The derivative maps transform tangent vectors: 
Vp +> Fi(vp) rp) > (GF)«(vp a(r(py)- 


It should be clear that F, can be achieved by applying the value of the Jacobian 
matrix for F at p but that G. requires the value of the Jacobian matrix for G at 
F(p). 


This last statement suggests that a ‘safer’ statement of the composite rule for 
mappings is 


(GoF), =G.(F)F., 
where G,(F) means G, evaluated at F. 


This alternative form of the composite rule for maps has the advantage of being 
very similar to the one for ordinary functions. 


The set of exercises for this section includes practice at finding derived maps of 
composites using Jacobians. 
Exercise 7.1 Express each of the following mappings from E to E in terms of 
the Euclidean coordinate functions. 
(a) F : p — —3p (b) F : p (eP!P?, ps + 2po, p?) 
Exercise 7.2 The mapping F : E? — E? is defined by 
F = (2Qey,2? — y’). 
Find the images under F of the following lines, and identify the image curves. 
(a)@=1 $) y=- 
Exercise 7.3 Let F : E? — E? be the mapping 
F = (x —y,z£ + y,2z), 
from O’Neill, Example 7.3. 


(a) Find, from first principles, the derivative map F,. 
(b) Show that F is linear, in the sense that 


F(ap+bq)=aF(p)+bF(q), for all p,q € EÊ, a,b E€ R. 


(c) Generalize the result for F to show that if G is any linear transformation 
(mapping) from E? to E°, then 


G.(vp) = G(v)c(p)- 


Exercise 7.4 Let F be a mapping from E? to E? and let G be a mapping from 
E? to E? defined by 


F=(e+y,2-2y,2+y+z) and G=(2?+y?,y? +27). 
(a) Write down the Jacobian matrix for F,. 


(b) Write down the Jacobian matrix for G, and evaluate it at F(x,y, z). 
(c) Use the composite rule to find the Jacobian matrix for (Go F)». 


[Solutions on page 35] 


The similarity may make it 
more memorable. 


25 


8 Summary 


=n 
Read ovneil: Chapter |, Section 8, page 41. 
—_——_——_ 


The first chapter of O’Neill has not provided all the tools needed for our study of 
curves. We shall need a little more discussion of frame fields and some work on 
reparametrization. The first two sections of Part II will provide what is necessary. 


There is one ‘spin-off’ from the work on derivative maps that we have done. We 
can provide a general version of the chain rule. Suppose that we extend the work 
done on Exercise 7, page 40 of O'Neill. 


We have mappings 
F=(fi, f2) G= (91,92) 


from E? to E?. The corresponding Jacobians are 


Ox Oy ôr dy 
d 

af. af | | ôn don 

Ox dy Ox dy 


The composite mapping G o F has Jacobian 


Ogi(fi, fe) gli, fo) 


Ox Oy 

dalfi, f2) Ogo(Fi, fo) 
Ou Oy 

Writing out the composite rule for derived maps gives 

Ogi(fi, f2) Agi (fi, fe) ðgı gy Of, Of: 
Ox Oy _| Ox dy (F) ôr Oy 

Oga(fis fo) gfi, fə) g Ago Of, Ifa 
Ox Oy Ox dy Ox Oy 


Taking the top left element of both sides gives 


nlf fa) _ 8 Le! 
ath f) an (fi, fa) oh + a foh) 


There are three other, similar results. 


afa 
ðr’ 


A similar approach for a function of three variables would give 


ð J2, ô ô ô ô 
A) O, fy, fah By PaSa DE + M, fo, fah, 


You might like to show that the composite rule for derived maps reduces to the 
ordinary chain rule in the case of mappings (functions) from R to R. (Hint: A 
1 x 1 Jacobian matrix is just like a function.) 


26 


Solutions to the exercises 


Solution 1.1 
(a) We have 
= (2 y)(ysin 2)? = 2?y° sin? z. 
(b) Here 
af 
aa 22y, 
a =sinz, 
ô i i 
Fea + SEF = (20y)lysin =) + (sin =)(2"v) 


= ry(2y+2)sinz. 
(c) Note that, by definition, 
#(f9) _ 2 fa) 
“Oyaz ~ Oy az ` 
We differentiate first with respect to z, then with respect 


to y. Although it is perfectly easy to find fg and work on 
that, we shall ‘ha ei to use the product rule: 


alfy 

AD -ta 
Say oe 
= 2”y’ cos z, 

a? (fg) _ A(x? y* cos z) 

ðyðz oy 
= 227 ycos z. 


(d) This provides an example suitable for using the chain 
(composite) rule: 


Osin(f) _ d(sin(f)) 3f 
ay “e ay 


= cos(f) z? 

= x” cos(x"y). 
Note that we could have written the start of this solution 
as 


F sin'(f) 92 = 


Solution 1.2 


(a) This solution is full to the point of tediousness just 
to illustrate what is meant by the pointwise principle 
(abbreviated here to pp). 


f(1,1,1) = (2?y — y?z)(1,1,1) 
= (a? y)(1, 1,1) — (y?z)(1,1, 1) (pp for subtraction) 
= (27)(1,1, 1)y(1, 1,1) — (y?)(1,1,1)2(1,1, 1) 

(pp for multiplication) 


= 2(1,1,1)2(1,1,1)y(1,1, 1) — y(1, 1, 1)y(1, 1, 1)2(1, 1,1) 


(pp for multiplication) 
=1x1x1-1x1x1=0. 


In practice, you probably did the calculation with a 
fraction of the effort expended here. 


(b) This time we give a shorter solution! 


£(3,-1,1/2) = (z°y — y’z)(3,-1, 1/2) 
= 3? x (—1) — (-1)? x (1/2) 
= —9 — 1/2 = -19/2. 
(c) Here 


f(a,1,1—a)=@ x1- I? x(1-a) =a’? +a- 1. 


(d) This time 
fte?,Pj=? x? (Peat —t. 


Solution 1.3 


These solutions use the linearity properties, the Leibniz 
property and the chain rule as appropriate. 


(a) Here 
of. ô sin(xy) ð cos(xz) 
a sin(xy) + ar E Ocos(xz) + 7 


= sin(ry) + zy cos(ry) — yz sin(xz). 
(b) In this case 
of = cei) dg (chain rule) 
d(e”) ðh 
dh Ox 
= cosg e” Qa 


2 2 2 2 2 2 
= 2re” W +E cos (e HEF ) k 


= cosg (chain rule) 


Solution 1.4 


What would be extremely useful for this question is a 
generalized chain rule. However, for now, we calculate the 
composite function in each case and differentiate directly. 


(a) Here 


f= (2 +y) —(y’)(e +2), 


ô 
SE = (2 +y) (+0) = 2 +y) =. 
(b) This time 


2 
f= ette 


= e e?e 
e77 (1 — e”), 

so 

Of _ 5,227, _ 29 

Oe 2e**(1 — e”) 
(c) Now 

f = 2? —(-2)(x) = 22°, 
so 

of 

On 4r 
Solution 2.1 


By definition, the vector part of 3vp — 2wp is 
3v — 2w = 3(—2, 1, —1) — 2(0, 1,3) 
= (—6, 1, —9). 
Hence 
3Vp — 2Wp = (—6, 1, —9)p. 
This result could also be written 
3Vp — 2Wp = —6U1 (p) + U2(p) — 9U: (p). 


27 


Solution 2.2 and the three vectors are linearly independent. 


First, we find W — zV; (b) We want 
W — zV = (2270, — U3) — 2(2Uy + yr) 2U, + yg + 2U3 = fV, + gVo + hVs 
= 227 U2 — Uz — 2°Uy — nye = f(U; — 2U3) + gU2 + h(x; + Us), 
= —27U; + (22? — ry)U2 — Us. where f, g and h are functions from E° to R. Comparing 
Now, the value at p = (—1,0, 2) of this vector field is the coefficients of the U; gives 
(—27U; + (2x? — xy)U2 — Us)(—1, 0, 2) z= f+ hs, 
= —(-1)?Ui(—1, 0, 2) + (2(-1)? — (—1)(0))U2(—1,0, 2) LEN 
—Us(—1, 0, 2) z=—frt+h. 
= ~U;(-1,0,2) + 2U2(-1, 0, 2) — Us(—1, 0,2) We have 
= — (1,0, 0)¢—1,0,2) + 2(0, 1, 0)¢—1,0,2) — (0, 0, 1)(-1,0,2) 3 =y, 
= (-1, 2, —1)(-1;0,2) and substituting f = x — ha from the first equation into 
ae wy the third gives 
Solution 2.3 z=—(rx-—he)r+h 
(a) Since =h(1 +27) -—2?. 
TV = 22°U, — zyUs, Hence 
we have h=(2?+2)/(1 +27) 
2 
Vay, 40, — Us. and 
T T r? +z 
You may, quite reasonably, have omitted the middle term Sai 1422 
since it is zero. 2 
ak e 
(b) We can rewrite the information given as = a(1— T+ z7) 
V (p) = p101 (p) + (ps — pı)U2(p) + 0U3(p). 20-2) 
If we also use z(p) = pi, etc., we obtain i 1+ 2? 
V(p) = 2(p)Ui(p) + (2(P) — 2(p))Ua(p) + 0Ua(p) ph sü Dhr 
which, by the pointwise principle, defines V to be zUı + yU2 + 2U3 = 1422 Yi tyVo 4 T+ a2 Vs. 
V = gU, +(2—2)U2 + 0U3. 
(c) We have V = £U1 + 2yU2 + xy? U3. 
(d) First, we need the vector part of V(p), which is Solution 3.1 
(1+ pi, pops, p2) — (pr, p2, p3) = (1, pops — pa, p2 — ps). To apply the definition we shall need p + tv: 
By an analysis like that in the first part of this question, pt+tv = (2 + 2t, —t, -1 + 3t). 
we obtain (a) Using the above: 
V EUa Pie Uart aa: vel f] = ((9?2)(2 + 2t, =t, —1 + 34))'(0) 
(e) Here the vector part is (—pi, —p2, — p3), so bs ((-t)?(-1 + 3t))'(0) 
V(p) = (—p1, —p2, —p3)p = (2t(—1 + 3t) + (3))(0) (product rule) 
and so =0+0=0. 


V = —zU; — yU2 — zU3. (b) Working as in the last part, 


Solution 2.4 vpLf] = ((2 + 2t)’)'(0) 


6 
(a) By the definition of independence of tangent vectors = (7(2 + 2t)" x 2)(0) 


3 


it is enough to show that the vector parts are linearly =7x2 x2 
independent. The vector parts are = 896. 

(1,0,—pi); (0,1,0) and (pi, 0,1). (c) This time 
We apply the standard test for linear independence. vp Lf] = (e+ cos(—t))'(0) 


Suppose that 
a(1,0,—pi) +b(0,1,0) + c(p1,0,1)=0, a,b,ceER. 


Comparing components gives 


= (2e?t? cos(—t) + e?*?*(— sin(—t))(—1))(0) 


= 2e? cos 0 + e° sin 0 


ape, 

a+cpm =0, s 
b= 0, 
-ap + e= 0. 


We have b = 0 immediately. The third equation gives 
c = apı, and substituting in the first gives 


a+ap = a(l + pi) =0. 
Since 1+ Pi > 0, we have a = 0 and hence c= 0. Thus 


a=t=e= 0, 


28 


Solution 3.2 


For each part, we calculate the three partial derivatives, 
evaluate at p and then apply the formula from 
Lemma 3.2. 


a a 
(a) sf =o, slip) =o, 
a a 
Fe = tue, SLi) = 2(0)(-1) =0, 
of ay, ofp) = 0? =0, 
vp[f]=0 x 2+0x(-1)+0x3=0. 
(b) of tas, Sfp) =7 x 2° = 448, 
OF. Of. 
ay T> By?) =° 
af af 


(c) = = e" cosy, Bp) = eè cos = e, 
of 
oy 


=> =-e*siny, (p) = —e? sin 0 = 0, 


=i, ise 
a2 = 9 Bz (P) =9, 
vplf] = e?(2) +0 + 0 = 2e. 


Solution 3.3 


The main technique used here is U;[f] = 3f /dx2;. We also 
use linearity but, as usual, this is more or less taken for 
granted, 


V[F] = (2V1 — U2 + 3Ua)[f] 
= 2U,[f] — V2[f] + 3Us[f) (linearity) 


= 2e” cos y — e? (— sin y) + 3(0) 
= e? (2 cos y + sin y). 


Solution 3.4 


These questions are all tackled by applying the various 
parts of Corollary 3.4 (pages 13-14) and U;[f] = Of /ðzx;. 


(a) Here 
V[F] = (y°U1 — sUa)[f] 
=U, [f]—2Us[f] (Corollary 3.4(1)) 


a oE ag Oh 
=Y 3r "Oz 
=%°(v) — 2(0) 
=°. 


(b) This time, 
V(g] = y’ U [g] — 2Us[g] 


299 og 
=Y Js 72 
= 4° (0) — 2(327) 
= 3027. 


(c) Now 
Vif9] =VIflg+fVIg] (Corollary 3.4(3)) 
= y°g + f (3x27) (using parts (i) and (ii)) 
=y°(2°)+ zy(—3227) 
= y2’ (yz — 327). 


(d) In this case, 
£V(9] — gV[F] = 2y(—3027) — 2° (4°) 
= —yz? (32? +yz). 
(e) We could do this question very simply by calculating 
Pry. 
However, there is some merit in seeing how Corollary 3.4 


gives an extension to a familiar result from ordinary 
calculus. First, note 


Vif?) = VIFF) 
=V[F]f +f VIS] 
=27V [f]. 


This result may remind you of applying the (ordinary) 
chain rule to obtain 


FY = aff". 
Applying the result above and a corresponding one for 
V[g?], 
VIZ? +g°] = VIF] +VIg’] (linearity) 
= 2f V[f] + 29 Vig] 
= 2(xy)(y*) + 2(2°)(—322°) 
= 22(y* — 32°). 
(f) Now, we have 
VIVI] = Vip") 
= (y°U; — 2Us)ly"] 
= p° Uily*] - 2Ualy*] 


_ 23y? dy® 
ro an Oz 
= „° (0) — 2(0) 
=0. 


Solution 3.5 

The question directs us to calculate the three derivatives 
V[z:] = v1 Ui [zi] + v2V2[xi] + v3Us[z:], i= 1,2,3. 

To do this, note that 


i, OAR i di 

0a,’ ðe i 
That is, 

Ox; Ox; P 

zs =1 ~——=0 A 

Ox; "Oa; p Oey 
Hence 


V [xi] = v1 Ui [z:] + v2U2[z:] + v3U3[2i] = vi, 
This is because only the term whose suffix is i is 
non-zero. Hence, by the definition of V, 


V = vU + v2U2 + v3U3 
=V[ri U1 + V[x2]U2 + V[xa]U3 


= x V[z;]U;. 


i=1,2,3. 


Solution 3.6 


The method of solving this question illustrates a good 
general point. When some property is given to hold for 
all of some class of objects, it is always worth applying 
the information to some ‘easy’ particular cases. Here we 
are told that V[f] = W[f] for all real-valued functions f 
on E*. We may obtain useful information by applying 


29 


this to the coordinate functions 2;, i = 1,2,3. We shall 
use the standard convention that 


V =U; + v2U2 + v3U3 
W = wil, + wUz + w3U3. 
By what we know, 
V[si] = W[zi], = 1,2,3, 
but we also know that 
V[ei] = vi, W[z:] = wi, 
so 
vj =wi, 1=1,2,3. 
This means that V = W. 


Solution 4.1 


(a) We obtain the velocity by differentiating each 
coordinate function: 


a'(t) = (—4 cos tsin t, 2 cos 2t, 2cost)at), O<t< z 
Note that the velocity vector has a point of application 


since it is a tangent vector (as well as being a vector 
tangent to the curve). 


Evaluating at 7/4, we use 


in” Scot = 

sin 7 = co T 
cos 3 =0, snz = 1. 
Thus 


a’ () = (-2,0, V2) 3,2): 
(b) (i) The length of a(t) is given by 

la(t)l| = V/4 cos! t+ sin? 2t + 4sin? t 

= V/4cos! t+ 4sin? tcos? t+ 4sin? t 
(sin 2t = 2sin tcos t) 

= /4cos? t(cos? t + sin? t) + 4sin? t 
= V4cos? t + 4sin? t 
=V4=2. 


(ii) By similar methods, 


lla(t) — (1, 0, 2sin t)|| = JV cos? t — 1)? + sin? 2t + 0? 
= y cos? 2t + sin? 2t 
(cos 2t = 2cos? t — 1) 
=vV1=1. 
(c) Given that 
t = h(s) = arcsin s, 
that is, 
s=sint, 
we shall need cost in terms of s as well: 
cost = V/1—sin?t = \V/1 — s?. 
Hence, using 
sin 2t = 2sin tcost, 
we have 
B(s) = a(h(s)) 
= (2(/1 = 82)’, 28\/1 — 82, 2s) 
= (2(1 — s”), 2sy/1 — s?, 2s). 


30 


Solution 4.2 


If we assume that the coordinate functions of a are ay, 
œz and a3, then we know that 


a(0)=1, a()=?, 
a2(0)=0, a3(t)=¢, 
a3(0)=—-5, ag(t)=e'. 


Integrating, and using the initial values, gives 


e 
ai(t) = 7 Ph 


P 

a(t) = > 

aa(t) =e — 5, 
so 

3 2 

a(t) = (5 +1, se — s) r 
Solution 4.3 
We begin by expressing the equation of each line in 

p+tv 
form. For the first line, 

p = (1, —3,—1), 


v = (6,2,1) — (1, —3, —1) = (5,5, 2). 
Hence the equation is 

a(t) = (1, —3, —1) + 4(5, 5, 2) 

= (1+ 5t, —3 + 5t, —1 + 2t). 

For the second line, 

p=(-1,1,0), 

v = (—5, —1, —1) —(-1,1, 0) = (—4, —2, —1). 
This gives 

p(s) = (-1 — 45, 1 — 2s, —8). 
In order to discover if these lines meet, we must try to 
solve 

(1 + 5t, —3 + 5t, —1 + 2t) = (—1 — 45, 1 — 2s, -s). 
Comparing the third components shows that, if there is a 


solution, we must have s = 1 — 2t. The first component 
now gives 


1+5t = —1 — 4s = —1 — 4(1 — 2t) = 8t — 5, 


so 

6=3t 
or 

t=2. 
Hence 


s=1-4=-3. 

Checking the second components: 
—3+5t = —3 + 5(2) =7, 
1—2s =1-(-6)=7. 


Thus, taking t = 2, s = —3 shows that the point (11,7, 3) 
lies on both lines. 


Solution 4.4 

(a) The three velocity vectors are 
earar 
(cos t, — sin t, leites tt) and 


(cosh ¢, sinh £, 1)(sinh t,cosh t,t)- 


It follows that the velocity vectors at t = 0 are all 
Vp = (1,0, 1)(0,1,0)- 
(b) We apply the corollary to Lemma 4.6 that was 
discussed earlier in this section. For each curve, we 
evaluate f on the curve, differentiate and then substitute 
t=0: 
f(t,1+?,t =? -14?) 42, 
Derivative = 2t — 2(1 + 4?)(2t) + 2t, 
Att=0, value = 0 — 0 + 0. 
Hence vp[f] = 0. 
Similarly with the second curve: 
f(sint, cost, t) = sin? t — cos?’ t + t°, 
Derivative = 2sin tcost + 2costsin t + 2t, 
At t= 0 value= 0+0+0=0. 
For the third: 
f(sinh t, cosh t, t) = sinh? t — cosh? t + #, 
Derivative = 2sinh t cosh t — 2 cosh tsinh t + 2t, 
At t= 0 value = 0+0 +0 =0. 


Solution 4.5 


Following the definition given in the question, we first 
find the velocity vector: 

a(t) = (—2sin t, 2 cos t, 1){2 cos t,2sin nt) 
Now we write down the tangent line: 

ut— (2cost, 2sin t, t) + u(—2sin t, 2cos t, 1), 
that is, 

ut— (2(cos t — usin t), 2(sin t + u cos t), t + u). 
Hence, at t = 0 we obtain 

ur (2, 2u, u). 
At t = 7/4, the tangent line is 

urs (V2(1 — u), V2(1 +u), z/4+ u). 
Note: In the above solution we have opted to find the 
general form of the tangent line and then substitute to 


obtain the two special cases asked for. You may well have 
dealt with each special case individually. 


Solution 5.1 
(a) We use the properties of dz, etc., that is 
dz(vp)=%, dy(vp)= v2, dz(vp) = v3. 
We have 
(4? dx) ((1, 2, -3)(0,-2,1)) = y(0, —2, 1) dz(1, 2, —3) 
= (-2)°(0) 
=4. 


(b) Similarly, 
(zdy — ydz)((1,2, —3)(0,-2,1)) 
= z(0, —2, 1)dy(1, 2, —3) — y(0, —2, 1)dz(1, 2, —3) 
= 1(2) - (~2)(-3) 
=—4, 
(c) Here, 
((2? — A)de — dy +2? dz)((1,2,-3)¢0,-2.) 
= (1? =1)(1) - (2) +0°(-3) 
=0-240 
= —2. 


Solution 5.2 


In this solution we make heavy use of the values of dz, dy 
and dz on U1, U2 and U3: 


dz(U;)=1, dz(U2)=0, dx(Us) =0, 
dy(U:)=0, dy(U2)=1, dy(Us) =0, 
dz(U;)=0, dz(U2)=0, dz(Us) =1. 


(a) Expanding ¢ and using linearity gives 
$(V) = (x? dz — y? dz)(xU1 + yU2 + 2Us) 


= z? dx(2U; + yU2 + 2U3) — y? dz(xU; + yU2 + 2U3) 


= z°(z dz(U1) + ydz(U2) + zdx(Us)) 
—y?(xdz(Ui) + y dz(U2) + z dz(U3)) 
(linearity of 1-forms) 
= z?°(z +0 +0)-— y? (0+0 + 2) 
ca 
=r — yz. 
(b) Similarly, 
o(W) = (x? dz — y? dz)((£y + yz)U1 — yzU2 — xyU3) 
= 2°(zy + yz) — y’ (—2y) 
(method as first solution) 
= zy(x? + s42} ys 
(c) There is no need to start from scratch with this one, 
we can use the two previous results and linearity: 


$ (iv $ iw) = 16v)+ aw) (linearity) 


= Le —y°z)+ Foa? + 22(y’) 


= os +2(2” +234 =y’). 
Solution 5.3 
These are all applications of the chain rule. 
(a) Here, 
a(f*)=5f* df. 
(b) This time, 


AVP) = 557 af 


= th 
=a 
(c) Finally, 
d(log(1 + f°) = 2f af 
_ 2fdf 
TE 


Solution 5.4 

It is worth noting that the three partial derivatives of 
(2? +y? 427)? 

will all have the same form because of the symmetry: 
a(x? +y? + 22)12 


T = 3(2? + y + 2) (22) 


T 
a (x? +y + 22)1/2° 


Similarly, 
A(z? +y? + z?)1/2 y 
ay CETATI 
O(z? 4+ y? +2) z 
oe ~ (a +4? + 212" 


31 


Hence 


1 
df = CTET + ydy + zdz). 
Solution 5.5 


In each case we check the two requirements: that ¢ be a 
real-valued function on tangent vectors and that it be 
linear at each point of E°. 


In order to express the 1-forms in terms of dz, dy and dz, 
we use 
dz(v) =v, dy(v)=v2, dz(v) = v 
and the pointwise principle. 
As usual, we use dz etc. and dz; etc. interchangeably. 


(a) This is a 1-form. First, by the definition given, ¢ is a 
real-valued function on tangent vectors. Next, 

(av + bw) = (avı + bwi) — (avs + bws) 
a(vı — v3) + b(wı — ws) 

= ad(v) + bd(w), 

so ¢ is linear at each point of E°. 
We have 

$(v) = vı — v3 = dx(v) — dz(v), 
so, by the pointwise principle, 

ġ = dz — dz. 
(b) This is not a 1-form, although it is a real-valued 
function on tangent vectors. It fails to be linear, as the 


following counterexample shows. Let v be any tangent 
vector at the point p = (1,0,0). Then 


$(v)=1-0=1. 


However, ¢(2v) = 1 — 0 = 1, since 2v is based at the 
same point, thus 


o(2v) # 2¢(v). 


(c) This is a 1-form. It is a real-valued function on 
tangent vectors and, at each point, pı and p3 are fixed so 


(av + bw) = (avı + bwi)p3 + (av2 + bwe)pr 
= a(vips + v2pi) + b(wips + wap) 
= aġ(v) + bġ(w), 


so ¢ is linear. 
Here 
$(v) = vips + vepr 
= dz(v) 2(p) + dy(v) x(p), 
so 
d=zdr+ady. 


(d) This is a 1-form. By the definition, vp[f] is a real 
number for any function f on E*, so ¢ is a real-valued 
function on tangent vectors. Also, Theorem 3.3(1) shows 
that 


olav + bw) = (av + bw)[x” + y°] 
= av[z? +y"] + bw[x? + 4°] 
= ag(v) + bd(w), 
and ¢ is linear at each point. 


We first evaluate ¢(v) using 
3 
ð 
volfl= SL@)e: 


32 


as 
o(v) = velz? + 97] 
= 22(p) vı + 2y(p) v2 + 0v3 
= 2x(p) dx(v) + 2y(p) dy(v). 
Thus 


ġ = 2xdx + 2y dy. 


Note: You could simply have quoted O’Neill’s general 
result that df is a 1-form whose value on vp is vp[f]. The 
last part follows from 


3 
=5 of 
df= aap tt 
i=l 


(e) This is a 1-form. It is a real-valued function on 
tangent vectors. Also, 


(av + bw) =0 
=0+0 
= ag(v) + bġ(w). 
In this case it is clear that 
$ =0dr+0dy +0dz. 


(f) This is not a 1-form. It fails to be linear, and the 
same counterexample as used in part (b) will illustrate 
this failure. (Use any tangent vector based at (1,0,0).) 


Solution 6.1 


In the first part we make use of the distributive 
properties of the wedge product, the alternation rule and 
the ‘repeats are zero’ property. 


In the second part, we use the definition of exterior 
derivative. 


(a) (i) We have 
oA = (yz dz + dz) A (sin z dz + cos z dy) 
= yzsin z dz dx + yz cos z dz dy 
+sin zdzdz + cos z dz dy 
= yz cos z dz dy — sin z dz dz — cos z dy dz. 
(ii) Here 
PAE = (sin zdz + cos z dy) A (dy + ž dz) 
= sin z dx dy + z sin z dz dz 
+ cos z dy dy + z cos z dy dz 
= sin z dz dy + z sin z dz dz + z cos z dy dz. 
(iii) In this case 
EAG= (dy + zdz) A (yz dz + dz) 
= yz dy dz + dy dz + yz? dz dz + zdz dz 
= —yz dz dy — yz” dz dz + dy dz. 
(b) We use the definition, which amounts to saying ‘to 


find the exterior derivative, find d of the coefficient 
functions’. 


(i) We obtain 
dọ = d(yz dz + dz) 
= d(yz) dz + d(1)dz 
= (0dz + zdy + y dz)dz + 0 
= zdy dz + y dz dz 
= —z dz dy — y dz dz. 


(ii) Here 
dy = d(sin z dx + cos z dy) 
= d(sin z) dx + d(cos z) dy 
= cos z dz dz — sin z dz dy 
= — cos z dz dz + sin z dy dz. 
(iii) In this case 
dé = d(dy + z dz) 
= d(1)dy + dz dz 
=0. 


Solution 6.2 


We calculate the three terms individually. First, 


dle ny) =4(Zaensay) 


=d (: is ay) 
y 


= = dy de dy + asdeas 
y y 


1 
=0--dadzd 
y Yy 


1 
=m age and 
(-} avis) 


1 
= -dr dy dz. 
y y 


Next, 
dọp^yp=d (Ze) Azdy 


i 
Satya Ady 


=0 because dy is repeated. 
Finally, 
abd -5 de Ad( dy) 


1 
= —-dr Adzd 
y y 


DORT N 
y 


= E irapa 
y 


Comparing these results, we see that 
d($ ^y) =d AY- ġ Adh 


in this case. 


Solution 6.3 


This solution hinges on the fact that ‘mixed? second 
derivatives are independent of the order of differentiation 
(at least for the well-behaved functions considered in this 
course). That is, 


2 (32) PPE (OF 
Oy \dx) ðs \ ðyj” 
We know that 


OF gy OF 
df = 5 da + dy + 


of 
dy dz. 


Oz 


So, finding the differential of each coefficient, 


ô of dof, | @ of 
a(df) (2 ðr dart Oy Ox dy 4 Oz Ox de jie 


+ (Zier Ata Au) a 
+ (eet i AE pee) a 
= Hype dyas L dedy 

TET dr + 2 oF ana 

+ ogee dy+ Š ay dz 
=0+0+0=0 


The final collapse is caused by the remark before the 
solution and the alternation rule. 


Applying the above result gives 
d(fdg)=dfAdg+fd(dg) (Theorem 6.4(2)) 
=dfAdg+0 
= df A dg. 


Solution 6.4 


In this solution we apply the various parts of 
Theorem 6.4 together with linearity and the alternation 
rule. The result from the last solution is also useful. 


(a) We obtain 
a(f dg +g df) = df A dg + f d(dg) + dg Adf + g d(df) 
=dfAdg+0+dgAdf+0 
=dfANdg+dgAdf 
=0. 
Note: If you happen to recognize that 
fdg+gdf = d( fg), 
then the result can be obtained quickly: 
a(f dg + g df) = d(d(fg)) = 0. 
(b) In this case, we have 
a((f — g)(df + dg)) 
= d(f — g) A (df + dg) + (f — 9)d(af + dg) 
= (df — dg) A (df + dg) + (f — 9)(d(af) + d(dg)) 
= df A df + df ^ dg — dg Adf — dg ^ dg + (f — g)(0 +0) 
= df Adg + df A dg 
= 2df A dg. 
(c) Here we apply Theorem 6.4(3), then Theorem 6.4(2): 
a(f dg ^g df) = d(f dg) A (g df) — (f dg) ^ d(g df) 


= (df ^ dg + fd(dg)) A (g df) — (fF dg) A (dg A df + g d(af))) 


= (df A dg + 0) A (g df) — (f dg) A (dg A df +0) 
=Q. 
(Each term has a repeated differential.) 
(d) Here we have 
d(gf df) + a(f dg) 
= d(gf) A df + gf (df) + df A dg + fd(dg) 
= (fdg +gdf) A df +0+df Adg +0 
= f dg Adf +0 + df A dg 
= (1 — f)df A dg. 


33 


Solution 6.5 
Following the hint, if we start with 

¢ = fi dry + fz dz2 + fadzs, 
then 

dọ = df, A dx; + dfz ^ dzz + dfs A dzs, 
by the definition of exterior product. Now, consider the 
exterior derivative of each of the terms on the right-hand 
side of the last equation. For the ith term, 

d(dfi A dai) = d(df) A dzi — df A d(dz:) 

=04+0=0. 

Both terms are zero because d(dg) = 0 for any function g 


and so the second derivatives of both f and z; are zero. 
Hence d(d¢) = 0. 


Solution 6.6 


Because the functions are E? — R, we have the 
differentials 


af = $L ae + Fay, 
oy 
= 29 ay 4 OF 
dg = 5 de dy 
Hence 
ap nag = (ÊE Fact hay) a (5 SE da + A dy) 
Of ag Of og j 
= 0+ ra tet a5 dydx +0 
Of dg Of dg 
= lza By ax) W 
af af 
Ox Oy 
= dz dy. 
ðs 89 
Ox Oy 


Note: This result does imply the alternation rule 
because dg A df will give rise to a determinant consisting 
of the same entries as above but with the columns 
interchanged. As you can easily verify, interchanging two 
columns of a determinant changes the sign. Hence 


dg Adf = —df A dg. 


Solution 7.1 


We have used z, y, z for the coordinate functions. 
Answers expressed in terms of xı etc. are equally 
acceptable. 


(a) Since 

—3p = (—3p1, —3p2, —3ps), 
we have 

F = (-32, —3y, —3z). 
(b) Here we have 

F = (e™”,z + 2y, x°). 


Solution 7.2 
(a) If p lies on the line s = 1, then p = (1, p2). Thus 
F(p) = (2p2,1 — p2). 


There are various ways of deciding what curve in E? is 
represented by 


p2 + (2p2, 1 — p). 


34 


One way is to see that the coordinates are related by 


r 2 
=1-(=) , 
s ( 2 ) 
and so the (route of the) curve is a parabola. 
(b) A similar approach shows that if p = (pı, —1), then 
F(p) = (—2p1, pt — 1), 
so the curve has equation 
-T 2 T 2 
om (G) ~1= (Fy = 


This is also a parabola. 


Solution 7.3 


(a) Suppose that vp is a tangent vector to E°. Then, by 
the definition, 


F.(vp) = (F(p + tv))'(0) r(p)- 
Now, 
F(p + tv) = F(pi + tvi, po + tvz, pa + tvs) 
= ((p1 + ter) — (p2 + tv2), (pı + tv1) + (p2 + tv2), 2(pa + tva)) 
= ((pı = p2) + (v1 — v2), (pı + p2) + t(vı + v2), 2ps + 2tvg) 


(F(p +tv) = Ha — v2, v + v2, 2v3), 
(F(p + tv))'(0) = (v1 — v2, v1 + v2, 2v3), 
F(p) = (pi — po, pı + pr, 2ps). 
Hence 
Fa(vp) = (v1 — v2, v1 + v2, 2v3 )(p1—=pa,p1 +2 :2p3): 


Note that in this case we also have 
Fi(vp) = F(v) rp) 


(b) Using the standard notation for components of p and 
q, we have 


F(ap + bq) = F(api + bq, ap2 + bg2, aps + bgs) 
= ((ap: + bq1) — (apa + bg), (apr + bq) + (ape + bg2), 
2(aps + bgs)) 
= (a(pı — pr) + (1 — 92), a(pı + p2) + B(q1 + 42), @2p3 + b2q3) 
= a(pı — p2, Pi + p2, 2p3) + (41 — 92,91 + 42, 248) 
= aF(p) + bF(q). 


(c) We apply the definition of G, and the information 
that G is linear: 


Gelvp) = (GP + tv) exp) 

= (G(p) + tG(v))'(O)acpy (linearity) 
=(0+G(v))(O)a(p) (differentiating w.r.t. t) 

= G(V)a(p)- 


Solution 7.4 
(a) Since 
(x+y) 
Ox 
etc., we have that the Jacobian for F is 


1 1 0 
1 -2 0]. 
1 a iT 
(b) Similarly, the Jacobian for G is 
2x Qy 0 
0 2y 2z)° 
Evaluating this at F gives 


wre 2(x — 2y) 0 ) 
0 2(x— 2y) %e+y+z) 


=1 


since 2(F) =z(r+y,c—2y,n+y+z)=2+y, etc. 


(c) Combining the last two parts gives the Jacobian for 
GoFas 


ti 2(z — 2y) 0 e Es o) 
0 2(x— 2y) 2(£+y+2) i 


= 2(2x — y) 2(—z + 5y) 0 
~ (2(2r-y+z) 2(-1+5y+2z) %c+ytz)]° 


35 


ISBN 0 7492 4770 3 


